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Abstract 

This paper studies estimation of panel cointegration models with cross-sectional dependence 
generated by unobserved global stochastic trends. The standard least squares estimator is, in 
general, inconsistent owing to the spuriousness induced by the unobservable 1(1) trends. We 
propose two iterative procedures that jointly estimate the slope parameters and the stochastic 
trends. The resulting estimators are referred to respectively as CupBC (continuously-updated 
and bias-corrected) and the CupFM (continuously-updated and fully-modified) estimators. We 
establish their consistency and derive their limiting distributions. Both are asymptotically 
unbiased and asymptotically normal and permit inference to be conducted using standard test 
statistics. The estimators are also valid when there are mixed stationary and non-stationary 
factors, as well as when the factors are all stationary. 



JEL Classification: C13; C33 

Keywords: Panel data; Common shocks; Co-movements; Cross-sectional dependence; Factor 
analysis; Fully modified estimator. 



*Department of Economics, New York University, New York, NY 10003, USA, and School of Economics and 
Management, Tsingiiua University. Email: Jushan.Bai@nyu.edu. 

^Center for Policy Research and Department of Economics, Syracuse University, Syracuse, NY 13244-1020, USA. 
Email: cdkao@maxwell.syr.edu. 

^Department of Economics, Columbia University, 440 W. 118 St. New York, NY 10027, USA, Email: Ser- 
ena. Ng@columbia.edu. 

We thank Yixiao Sun, Joon Park, Yoosoon Chang, and Joakim Westerlund for helpful discussions. We would also like 
to thank seminar and conference participants at NYU, Texas A&M, CESG 2006, Academia Sinica, National Taiwan 
University, Greater New York Area Econometrics Colloquium at Yale, and CIREQ Time Series Conference in Mon- 
treal for helpful comments. Bai and Ng gratefully acknowledge financial support from the NSF (grants SES-0551275 
and SES-0549978). 



1 Introduction 



This paper is concerned with estimating panel cointegration models using a large panel of data. Our 
focus is on estimating the slope parameters of the non-stationary regressors when the cross sections 
share common sources of non-stationary variation in the form of global stochastic trends. The 
standard least squares estimator is either inconsistent or has a slower convergence rate. We provide 
a framework for estimation and inference. We propose two iterative procedures that estimate the 
latent common trends (hereafter factors) and the slope parameters jointly. The estimators are ^JnT 
consistent and asymptotically mixed normal. As such, inference can be made using standard t and 
Wald tests. The estimators are also valid when some or all of the common factors are stationary, 
and when some of the observed regressors are stationary. 

Panel data have long been used to study and test economic hypotheses. Panel data bring 
in information from two dimensions to permit analysis that would otherwise be inefficient, if not 
impossible, with time series or cross-sectional data alone. A new development in recent years is 
the use of 'large dimensional panels', meaning that the sample size in the time series (T) and the 
cross-section (n) dimensions are both large. This is in contrast to traditional panels in which we 
have data of many units over a short time span, or of a few variables over a long horizon. Many 
researchers have come up with new ideas to exploit the rich information in large panels^ However, 
large panels also raise econometric issues of their own. In this analysis, we tackle two of these 
issues: the data {yit,Xit) are non stationary, and the structural errors en = yu — x'^^f] are neither 
iid across i nor over t. Instead, they are cross-sectionally dependent and strongly persistent and 
possibly non-stationary. In addition, en are also correlated with the explanatory variables xu. 
These problems are dealt with by putting a factor structure on eu and modelling the factor process 
explicitly. 

The presence of common sources of non-stationarity leads naturally to the concept of cointegra- 
tion. In a small panel made up of individually 1(1) (or unit root) processes yt and xt, where small 
means that the dimension of yt plus the dimension of xt is treated as fixed in asymptotic analysis, 
cointegration as defined in Engle and Granger (1986) means that there exists a cointegrating vector, 
(1 — /?'), such that the linear combinations yt — x[(3 are stationary, or are an 1(0) processes. In a 
panel data model specified by yu = x[^j3 + en where yu and xu are 1(1) processes, and that eu are 
iid across i, cointegration is said to hold if en are 'jointly' 1(0), or in other words, (1, —j3) is the 
common cointegrating vector between y^ and xn for all n units. A large literature panel cointe- 

^See, for example, Baltagi (2005), Hsiao (2003), Pesaran and Smith (1995), Kao (1999), and Moon and Phillips 
(2000, 2004) in the context of testing the unit root hypothesis using panel data. Stock and Watson (2002) suggest 
diffusion-index forecasting, while Bernanke and Boivin (2003) suggest new formulations of vector autoregressions to 
exploit the information in large panels. 
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gration already existqj for modelling panel cointegration when en is cross-sectionally independent. 
A serious drawback of panel cointegration models with cross-section independence is that there is 
no role for common shocks, which, in theory should be the underlying source of comovement in 
the cross-section units. Failure to account for common shocks can potentially invalidate estimation 
and inference of /sjfl In view of this, more recent work has allowed for cross-sectional dependence of 
Cit when testing for the null hypothesis of panel cointegrationlf] There is also a growing literature 
on panel unit root tests with cross-sectional dependence!^ 

In this paper, we consider estimation and inference of parameters in a panel model with cross- 
sectional dependence in the form of common stochastic trends. The framework we adopt is that 
Cit has a common component and a stationary idiosyncratic component. That is, en = A^-Fj -|- un, 
so that panel cointegration holds when uu = yu — Pxu — X'iFt is jointly stationary. A regression 
of Uit on Xit will give a consistent estimator for (3 when Ft is 1(0). We focus on estimation and 
inference about /3 when Ft is non-stationary. Empirical studies suggest the relevance of such a 
setup. Holly, Pesaran and Yamagata (2006) analyzed the relationship between real housing prices 
and real income at state level, allowing for unobservable common factors. They found evidence 
of cointegration after controlling for common factors and additional spatial correlations. Some 
economic models lead naturally to this set up. Consider a panel of industry data on output, factor 
inputs such as capital, and labor. Neoclassical production function suggests that log output yit is 
linear in log factor inputs xa and log productivity Cit- Decomposing the latent en into the industry 
wide component Ft and an industry specific component uu and assuming that Ft is the source of 
non-stationarity leads to a model with latent common trends. In such a case, a regression of yu on 
Xit is spurious since Cit is not only cross-sectionally correlated, but also non-stationary. 

We deal with the problem by treating the common 1(1) variables as parameters. These are 
estimated jointly with (3 using an iterated procedure. The procedure is shown to yield a consistent 
estimator of (3, but the estimator is asymptotically biased. We then construct two estimators to 
account for the bias arising from endogeneity and serial correlation so as to re-center the limiting 
distribution around zero. The first, denoted CupBC, estimates the asymptotic bias directly. The 

^See, for example, Phillips and Moon (1999) and Kao (1999). Recent surveys can be found in Baltagi and Kao 
(2000) and Breitung and Pesaran (2005). 

^Andrews (2005) showed that cross-section dependence induced by common shocks can yield inconsistent estimates. 
Andrews' argument is made in the context of a single cross section and for stationary regressors and errors. For a 
single cross section, not much can be done about common shocks. But for panel data, we can explore the common 
shocks to yield consistent procedures. 

''See, for example, Phillips and Sul (2003), Gengenbach et al. (2005b), and Westerlund (2006). 

^For example, Chang (2002,2004), Choi (2006), Moon and Perron (2004), Breitung and Das (2005), Gengenbach 
et al. (2005a), and Westerlund and Edgerton (2006). Breitung and Pesaran (2005) provide additional references in 
their survey. 
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second, denoted CupFM, modifies tiie data so that the hmiting distribution does not depend on 
nuisance parameters. Both are 'continuously updated' (Cup) procedures and require iteration till 
convergence. The estimators are ^/nT consistent for the common slope coefficient vector, (3. The 
estimators enable use of standard test statistics such as t, F, and for inference. The estimators 
are robust to mixed I(1)/I(0) factors, as well as mixed I(1)/I(0) regressors. Thus, our approach 
is an alternative to the solution proposed in Bai and Kao (2006) for stationary factors. As we 
argue below, the Cup estimators have some advantages that make an analysis of their properties 
interesting in its own right. 

The rest of the paper is organized as follows. Section 2 describes the basic model of panel coin- 
tegration with unobservable common stochastic trends. Section 3 develops the asymptotic theory 
for the continuously-updated and fully-modified estimators. Section 4 examines issues related to 
incidental trends, mixed I(0)/I(1) regressors and mixed I(0)/I(1) common shocks, and issues of 
testing cross-sectional independence. Section 5 presents Monte Carlo results to illustrate the finite 
sample properties of the proposed estimators. Section 6 provides a brief conclusion. The appendix 
contains the technical materials. 

2 The Model 

Consider the model 

Vit = x'it/3 + eu 
where for i = 1, n, t = 1, T, yit is a scalar, 

Xit = Xit-l + Eit. (1) 

is a set of k non-stationary regressors, /3 is a /c x 1 vector of the common slope parameters, and eu 
is the regression error. Suppose eu is stationary and iid across i. Then it is easy to show that the 
pooled least squares estimator of (3 defined by 

/ n T \ ^1 n T 

\j=l t=l / i=l t=l 

is, in general, T consistently] Similar to the case of time series regression considered by Phillips and 
Hansen (1990), the limiting distribution is shifted away from zero due to an asymptotic bias induced 
by the long run correlation between eu and eu- The exception is when xu is strictly exogenous, in 
which case the estimator is ^/nT consistent. The asymptotic bias can be estimated, and a panel 

®The estimator can be regarded as -v/nT consistent but with a bias of order 0{^/n). Up to the bias, the estimator 
is also asymptotically mixed normal. 
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fully-modified estimator can be developed along the lines of Phillips and Hansen (1990) to achieve 
yJnT consistency and asymptotic normality. 

The cross-section independence assumption is restrictive and difficult to justify when the data 
under investigation are economic time series. In view of comovements of economic variables and 
common shocks, we model the cross-section dependence by imposing a factor structure on en. That 
is, 



where is a r x 1 vector of latent common factors, Aj is a r x 1 vector of factor loadings and ua is 
the idiosyncratic error. If Ft and ua are both stationary, then en is also stationary. In this case, a 
consistent estimator of the regression coefficients can still be obtained even when the cross-section 
dependence is ignored, just like the fact that simultaneity bias is of second order in the fixed n 
cointegration framework. Using this property, Bai and Kao (2006) considered a two-step fully 
modified estimator (2sFM). In the first step, pooled OLS is used to obtain a consistent estimate 
of /3. The residuals are then used to construct a fully-modified (FM) estimator along the line of 
Phillips and Hansen (1990). Essentially, nuisance parameters induced by cross-section correlation 
are dealt with just like serial correlation by suitable estimation of the long-run covariance matrices. 

The 2sFM treats the 1(0) common shocks as part of the error processes. However, an alternative 
estimator can be developed by rewriting the regression model as 



Moving Ft from the error term to the regression function (treated as parameters) is desirable for 
the following reason. If some components of Xit are actually 1(0), treating Ft as part of error process 
will yield an inconsistent estimate for /3 when Ft and Xit are correlated. The simultaneity bias is 
now of the same order as the convergence rate of the coefficient estimates on the 1(0) regressors. 
Estimating /3 from ([3]) with F being 1(0) was suggested in Bai and Kao (2006), but its theory was 
not explored. 

When Ft is 1(1), which is the primary focus of this paper, there is an important difference 
between estimating /? from ([3]) versus pooled OLS in ([2]) because the latter is no longer valid. More 
precisely, if 



then eit is 1(1) and pooled OLS in ([2]) is, in general, not consistent. To see this, consider the 
following data generating process for Xit 



eit = A-Ft + Uit 



yu = x'itf3 + AjFj + Uit. 



(3) 



Ft = Ft-i + rit 



Xit = T'iFt+ ^i- 



it 



(4) 
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with being 1(1) such that = ^a^i + Cjt- For simphcity, assume there is a single factor. It 
follows that Xit is 1(1) and can be written as ([T]) with en = r'^rj^ + Qf-. The pooled OLS can be 
written as 

If Tj and Aj are correlated, or when they have non-zero means, the first term on the right hand side 
is Op(l), implying inconsistency of the pooled OLS. The best convergence rate is ^/n when xu and 
Ft are independent random walks. The problem arises because as seen from we now have a 
panel model with non-stationary regressors xu and Ft, and in which uu is stationary by assumption. 
This means that yu conintegrates with xu and Ft with cointegrating vector (1, — /?', Aj). Omitting 
Ft creates a spurious regression problem. It is worth noting that the cointegrating vector varies 
with i because the factor loading is unit specific. Estimation of the parameter of interest /3 involves 
a new methodology because F is unobservable. 

In the rest of the paper, we will show how to obtain ^/nT consistent and asymptotically normal 
estimates of P when the data generating process is characterized by ^ assuming that xu and 
Ft are both 1(1), and that xu, Ft and uu are potentially correlated. We will refer to Ft as the 
global stochastic trends since they are shared by each cross-sectional unit. Hereafter, we write 
the integral W{s)ds as f W when there is no ambiguity. We define O^/^ to be any matrix 
such that J7 = (il^/^) (il^/^) , and BM (fi) to denote Brownian motion with the covariance matrix 
Q. We use ||j4|| to denote {tr{A'A)y^'^, -^to denote convergence in distribution, to denote 
convergence in probability, [x] to denote the largest integer less than or equal to x. We let M < oo 
be a generic positive number, not depending on T or n. We also define the matrix that projects 
onto the orthogonal space of z as = It — z (z'z)^ z' . We will use F{^, and A° to denote 
the true common slope parameters, true common trends, and the true factor loading coefficients. 
Denote (n,T) ^ oo as the joint limit. Denote (n,T)^gg — > oo as the sequential limit, i.e., T ^ oo 
first and n ^ oo later. We use MN{0, V) to denote a mixed normal distribution with variance V. 

Our analysis is based on the following assumptions. 

Assumption 1 Factors and Loadings: 
(a) E llAj*!!^ < M. As n ^ oo, ^ X^"=i A^'A-' — ^ Ti\, a r x r diagonal matrix. 

(h) E 11% < M for some 5>Q and for all t. As T ^oo, ^ J27=i Pt^t' I BrjB'^, arxr 
random matrix, where is a vector of Brownian motions with covariance matrix Qr], which 
is a positive definite matrix. 
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Assumption 2 Let wu = (uu, e[^,r][^ . For each i, wu = Ui{L)vit = Yl'jLo^ij'^it-j where vu 
is i.i.d. over t, Yl^oJ'^ W'^ijW — /^'^ some a > 1, and |nj(l)| > c > for all i. In addition, 
Evit = 0, E{vitv[f) = Sj, > 0, and E\\vit\\^ < M < oo. 



Assumption 3 Weak cross-sectional correlation and heterokedasticity 

(a) E{uitUjs) = CTij^ts, Wij,ts\ < ^ij for all {t,s) and \crij^ts\ < rts for all {i,j) such that (i) 
71 Ey=i ^ij < ^> (i^) T Em=i ^ts < M, and (Hi) J2i,j,t,s=i WijM < M. 

4 

(b) For every {t, s) , E Y17=i [^isUit - E (uisUu)] < M. 

(c) J2t,s,u,v \cov {uitUis,UjuUj^)\ <M and Y.t,s T.i,j,k,l l^ow iuitUjs,UkuUis)\ < M. 

Assumption 4 are not cointegrated. 

Assumption [1] is standard in tlie panel factor literature. Assumption [3] allows for limited time 
series and cross-sectional dependence in the error term, ua- Heteroskedasticity in both time series 
and cross-sectional dimensions for uu is allowed as well. The assumption that fi^ is positive definite 
rules out cointegration among the components of F^. Assumption U] also rules out the cointegration 
between xu and Fj'. 

Assumption [2] implies that a multivariate invariance principle for wu holds, i.e., the partial sum 
process Ylt=i '^a satisfies: 



t=i 



B, 



B (Qi) as T ^ oo for all i, 



where 



Bi - [ Bui B.^ 
The long-run covariance matrix of {wa} is given by 



WiOWij 



j=-oo 



(5) 



are partitioned conformably with wu- Define the one-sided long-run covariance 



j=0 



A 

A 

A, 



eui Agj Ag^j 



rjui ^rjei 



(6) 
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Bbi — 


[ K^ 

L t i 


B 

'I 


Then Bi can be rewritten as 






Bi = 


Bui 

Bbi 







' Vi 




Wi 



For future reference, it will be convenient to group elements corresponding to en and r]^ taken 
together. Let 

' Q ^ei ^erii 

where [ Vi W- ] = BM (/) is a standardized Brownian motion and 

^u.bi ^ui ^ubi^bi ^bui 

is the long-run conditional variance of un given (Ax^^, A F^')' . Note that Qbi > since we assume 
that there is no cointegration relationship in {x[^, F^')' in Assumption HI 

Finally, we state an additional assumption, which is needed when deriving the limiting distri- 
bution but is not needed for consistency of the proposed estimators. 

Assumption 5 The idiosyncratic errors un are cross-sectionally independent. 

3 Estimation 

In this section, we first consider the problem of estimating (3 when F is observed. We then consider 
two iterative procedures that jointly estimate /? and F. The procedures yield two estimators that 
are ^/nT consistent and asymptotically normal. These estimators, denoted CupBC and CupFM, 
are presented in subsections 3.2 and 3.3. 

3.1 Estimation when F is observed 

The true model ([3]) in vector form, is 

Vi = x./jo + F^AO + Ui 

where 





yn 




^il 












yi2 




t 


,F = 


F2 






Vi = 




1 ^i — 




,Ui = 










t 

- ^iT - 




.K. 







Define A = (Ai, .., A„) to be an a n x r matrix. In matrix notation 



y = Xp° + F°A°' + u. 
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Given data y, x, and F'^, the least squares objective function is 

n 

1=1 

After concentrating out A, the least squares estimator for (3 is then 

/ n \ ~1 n 



Ki=l 



1=1 







The least squares estimator has the following properties 



Proposition 1 Under Assumptions 1-5, as {n, T)^^^ oo 



where 



(p^s - /?°) - V^4>It ^ MN (0, 



1 " r 1 



i=l 



i=l 



and with C being the a-field generated by {Ft} , 



limiVl^„.M^f / Q,Q-\C 
i=i 



D = Urn -Y E ( [ QiQ',\c] 



Qi 
5? 



B^B^ 



-1 



B. 



^x'.MpoAb^n^lnbui + (a+ , - <"a+ 

(^o'^O)-i^o'^.^ A6i = ( Axi AFO ) 



a: 



'r/u 



^bui ^bi 



h 

bi ^^bui 



A 



bui 



^bi^lfl ^bui- 



(7) 
(8) 



The estimator is \/nT consistent if (p^rp = 0, which occurs when xu is strictly exogenous. 
Otherwise, the estimator is T consistent as there is an asymptotic bias given by the term ^/n(j)^J'. 
This is an average of individual biases that are data specific as seen from the definition of 9^. 
The individual biases arise from the contemporaneous and low frequency correlations between the 
regression error and the innovations of the 1(1) regressors as given by terms such as Vtbui and A^uj. 



'^The limiting distribution for F being 1(0) can also be obtained. Park and Phillips (1988) provide the limiting 
theory with mixed 1(1) and 1(0) regressors in a single equation framework. 
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To estimate the bias, we need to consistently estimate the nuisance parameters. We use a kernel 
estimator. Let 

T-l 



j=T+l ^ ^ 

'J-' ]^ 



i=o 

T-j 



^ 1 / 



T 

t=l 

where wu = {uit, Ax'^^, AF^')' . To state the asymptotic theory for the bias-corrected estimator, we 
need the following assumption, as used in Moon and Perron (2004): 

Assumption 6 (a) liminf (log T/ log n) > 1. 

n,T—» oo 



(b) the kernel function a; (•) : R [—1, 1] satisfies (i) u) (0) = 1, u {x) = to (— x) , (ii) J^^ iv (x)^ dx < 
oo and with Parzen's exponent q G (0, oo) such that lim ^^^^^f^ < oo. 

(c) The bandwidth parameter K satisfies K ^ and ^ < b < liminf — 1. 
Let 

1 " / 



1=1 



-1 



where & = ^ J2i=i a consistent estimate of 9^. The resulting bias-corrected estimator is 

Plsbc = ~Pls - ^^nT- (9) 
This estimator can alternatively be written as 

x[Mpox, {^'i^Foyt - T (a+ , - SfA+,)) (10) 

i=l / i=l 

where y"*" and A+ are consistent estimates of and A+ etc, with 

+ _ o o-i f ^ + - o o-i f ^ 

Viewed in this light, the bias-corrected estimator is also a panel fully- modified estimator in the spirit 
of Phillips and Hansen (1990), and is the reason why the estimator is also labeled Plsfm- It is not 
difficult to verify that Plsbc Plsfm identical. Panel fully modified estimators were also 
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considered by Phillips and Moon (1999) and Bai and Kao (2006). Here, we extend those analysis 
to allow for common stochastic trends. By construction uf^ has a zero long-run covariance with 
( Ax'-^ AFj^' )' and hence the endogeneity can be removed. Furthermore, nuisance parameters 
arising from the low frequency correlation of the errors are summarized in A^^. 

Proposition 2 Let Plsfm be defined by fW\) . Under Assumptions 1-6, as {n^T)^^^ — > oo 

V^tCPlsfm - ^ MN (0, TP) . 

In small scale cointegrated systems, cointegrated vectors are T consistent, and this fast rate of 
convergence is already accelerated relative to the case of stationary regressions, which is ^/T. Here 
in a panel data context with observed global stochastic trends, the estimates converge to the true 
values at an even faster rate of ^/nT and the limiting distributions are normal. To take advantage 
of this fast convergence rate made possible by large panels, we need to deal with the fact that 
is not observed. This problem is considered in the next two subsections. 

3.2 Unobserved F^ and the Cup Estimator 

The LSFM considered above is a linear estimator and can be obtained if F^ is observed. When 
F^ is not observed, the previous estimator is infeasible. Recall that least squares estimator that 
ignores F is, in general, inconsistent. In this section, we consider estimating F along with (3 and A 
by minimizing the objective function 

n 

SnT (/?, F, A) = ^ (y - Xif3 - FXi)' (y - Xi(3 - FXi) (11) 

i=l 

subject to the constraint T~^F F = Ir and A A is positive definite. The least squares estimator 
for /? for a given F is 



P=[Y^ x'iMpXi ) x'iMpyi. 

Define 



Wi = yi- Xif3 
= FXi + Ui. 

Notice that given /3, Wi has a pure factor structure. Let W = {wi, ...,Wn) be a T x n matrix. 
We can rewrite the objective function (jlip as ir[(W^ — FJ\^)(W — FA')']. If we concentrate out 
A = W'F (f'F^ = T-'^W'F, we have the concentrated objective function: 

tr (w'MfW^ = tr (w'W^ - tr (^f'ww'f/T'^^ . (12) 
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Since the first term does not depend on F, minimizing ()12p with respect to F is equivalent to 



maximizing tr yT F WW Fj subject to the constraint T F'F = Ij.. The solution, denoted F, is 
a matrix of the first r eigenvectors (multiplied by T) of the matrix Y17=i iVi ~ ^iP) iVi ~ ^iP) ■ 
Although F is not observed when estimating /3, and similarly, /? is not observed when estimating 
F, we can replace the unobserved quantities by initial estimates and iterate until convergence. Such 
a solution is more easily seen if we rewrite the left hand side of (|12p with y — x/5 substituting in for 
W. Define 

1 " 

SnT iP, F) = ^yZiVi- Xi(3)' Mf {Vi - XiP) . 



i=l 



The continuous updated estimator (Cup) for (/3, F) is defined as 

Pcup^^Cup ^ 

More precisely, {Pcup^ ^Cup) is the solution to the following two nonlinear equations 



argmin SnT {f3, F) 

I3,F 



FVnT 



vi=l 



2 = 1 



i=l 



(13) 
(14) 



where Mp = It — T~^FF' since F F/T'^ = Ir, and VnT is a diagonal matrix consisting of the r 
largest eigenvalues of the matrix inside the brackets, arranged in decreasing order. Note that the 
estimator is obtained by iteratively solving for (3 and F using (jl3p and (jl4p . It is a non-linear 
estimator even though linear least squares estimation is involved at each iteration. An estimate of 
A can be obtained as: 

A = T-'^f' (y - XP^ . 

The triplet ^/3,F, A^ jointly minimizes the objective function pip . 

The estimator Pcup is consistent for /3. We state this result in the following proposition. 

Proposition 3 Under Assumptions [Ij^ and as {n,T) oo, 

Pcup 

We now turn to the asymptotic representation of pQ^p. 
Proposition 4 Suppose AssumptionslM^hold and {n,T) oo. Then 



'Cup 



D{F'>) 



1 " / / 1 " / \ 

^ ix'iMpo --Yj aikx'iMpoj Ui 



+ Op (1) , 
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where an. = A'j 



A A 



Afc, D = Yh=i Z[Zi and Zi = MpoXi - ^ I]fc=i MpoXkaik- 



In comparison with the pooled least squares estimator for the case of known , estimation of 
the stochastic trends clearly affects the limiting behavior of the estimator. The term involving aik 
is due to the estimation of F. This effect is carried over to the limiting distribution and to the 
asymptotic bias, as we now proceed to show. Let wu = {uu, Ax'j, !][)' where Xi = rcj — ^ Ya:=i ^k^-ik- 
For the rest of the paper, we use bar to denote those long run covariance matrices (including one 
sided and conditional covariances and so on) generated from wu instead of wu. Thus, is the 
long run covariance matrix of wn as in ([5]), and define Aj is the one-sided covariance matrix of wu- 
These quantities depend on n, but this dependence is suppressed for notional simplicity. 

Because the right hand side of the representation does not depend on estimated quantities, it is 
not difficult to derive the limiting distribution of Pcup^ even allowing for cross-sectional correlation 
in Uit- However, estimating the resulting nuisance parameters would be more difficult. Thus, 
although consistency of the Cup estimator does not require the cross-section independence of uu, 
our asymptotic distribution for Pcup is derived with Assumption [5] imposed. 

Theorem 1 Suppose that Assumptions[l\-5 hold. Let Pcup obtained by iteratively updating I^13\) 
and [14^ . As {n,T)g^^ — > 00, we have 



V^T (Pc^p - /?) - ^ MN (0, S) 



where 



n 



-. n ^ ^ n 



i=l 



i=l 



1 ^' 



hm -y2^uME ( [ RniR'nilc) Dz= lim i Vi^ f / RmR'm\c) , (15) 

n-^oon ^—^ \J J n^oon \J J 

1 " 

Rni — Qi / J Qk 



!k^ik: 



k=l 



Abi = ( Axi AFO ) , 



1 " 

-E 



Xi — Xi 7 Xf^ai]^ 

n 

k=l 

n 



^k^ik- 

k=l 
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Theorem 1 establishes the large sample properties of the Cup estimator. The Cup estimator 
is ^JnT consistent provided that = 0, which occurs when xa and Ft are exogenous. Since 
4'nT — '^p(l)) the Cup estimator is at least T consistent. This is in contrast with pooled OLS in 
Section 2, where it was shown to be inconsistent in general. Nevertheless, as in the case when F 
is observed, the Cup estimator has an asymptotic bias and thus the limiting distribution is not 
centered around zero. There is an extra bias term (the term involving aik) that arises from having 
to estimate Ff. In consequence, the bias is now a function of terms not present in Proposition 
1, which is valid when Ft is observed. We now consider removing the bias by constrTicting a 
consistent estimate of (j^nT- This can be obtained upon replacing F^ , A6j, J^^i, ^bui-, ^tuv 
their consistent estimates. 

We consider two fully-modified estimators. The first one directly corrects the bias of Pcup^ 
is denoted by PcupBC- The second one will be considered in the next subsection, where correction 
is made during each iteration, and will be denoted by PcupFM- 



Consider 






T-j 



t=l 



where 




k=l 



The bias-corrected Cup estimator is defined as 



PcupBC — Pcup - 7f;(PnT 



where 




1 -1 



1=1 



1=1 




k=l 
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Theorem 2 Suppose Assumptions[ME hold. Then as {n,T)^^^ — > oo, 

V^T (dcupBC - /?°) ^ MN (0, S) . 



The CupBC is y/nT consistent with a hmiting distribution that is centered at zero. This type 
of bias correction approach is also used in Hahn and Kuersteiner (2002), for example, and is not 
uncommon in panel data analysis. Because the bias-corrected estimator is y/nT and has a normal 
limit distribution, the usual t and Wald tests can be used for inference. Note that the limiting 
distribution is different from that of the infeasible LSBC estimator, which coincides with LSFM 
and whose asymptotic variance is S'^ instead of S. Thus, the estimation of F affects the asymptotic 
distribution of the estimator. As in the case when F is observed, the bias corrected estimator can 
be rewritten as a fully modified estimator. Such a fully-modified estimator is now discussed. 

3.3 A Fully Modified Cup Estimator 

The CupBC just considered is constructed by estimating the asymptotic bias of Pcup^ then 
subtracting it from Pc^p- In this subsection, we consider a different fully- modified estimator, 
denoted by PcupFM- Let 

-1 / 



Vit = Vit - ^ubi^bi 



~5i = i'F'F] ^F'% 



AFt 



where Cluti, ^bi, and Abut are estimates of Clubi^^bi and Ahui, respectively. Recall that Pcup is 
obtained by jointly solving (fTH|) and (fT^ . Consider replacing these equations by the following: 

dcupFM = (jZ^'i^F^^ JZl^'^iMpVt -T(^Atui-%Ku^^ (16) 

n 



FVnT 



^^2^2 ^ ~ ^if^CupFM^ {vi ~ ^iPcupFM 
i=l 



F. (17) 



Like the FM estimator of Phillips and Hansen (1990), the corrections are made to the data to remove 
serial correlation and endogeneity. The CupFM estimator for (/3, F) is obtained by iteratively 
solving (jl6p and (jl7p . Thus correction to endogeneity and serial correlation is made during each 
iteration. 

Theorem 3 Suppose As sumptions [J\- 6 hold. Then as {n,T)^^^ oo, 

V^T (pc^pFM - ^ MN (0, S) , 

where S is given in (Tl 
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The CupFM and CupBC have the same asymptotic distribution, but they are constructed 
differently. The estimator PcupBC does the bias correction only once, i.e., at the final stage of 
the iteration, and (^cupFM does the correction at every iteration. The situation is different from 
the case of known F, in which the bias-corrected estimator and the fully-modified estimator are 
identical due to the absence of iteration. Again, because of the mixture of normality, hypothesis 
testing on /3 can proceed with the usual t or chi square distributions. 

Kapetanios, Pesaran, and Yamagata (2006) suggest an alternative estimation procedure based 
on Pesaran (2006). The model is augmented with additional regressors yt and xt, which are cross- 
sectional averages of ya and xu- These averages are used as proxy for Ft. The estimator for the 
slope parameter (5 is shown to be ^/n consistent, but fully modified estimator is not considered. 

While the focus is on estimating the slope parameters /5, the global stochastic trends F are also 
of interest. We state this result as a proposition: 

Proposition 5 Let F he the solution of |77| ). Under Assumptions of 1-4, we have 

T 

t=i 

where H is an r ^ r invertihle matrix. 

Thus, we can estimate the true global stochastic trends up to a rotation. This is the same rate 
as in Bai (2004, Lemma B.l), where the regressor xa is absent. Similarly, the factor loadings Aj 
are estimated with the same rate of convergence as in Bai (2004). 

Thus far, our analysis assumes that the number of stochastic trends, r, is known. If this is not 
the case, r can be consistently estimated using the information criterion function developed in Bai 
and Ng (2002). In particular, let 

r = arg min IC (r) 

l<r<rmax 

where r < rmax, '"max is a bounded integer and 

IC{r) = log 5^ (r) + rgnT 

where gnT —> as n,T —> oo and min[n, r]^!^^- oo. For example, QnT can be log(a„T)/a„T, with 
^nT = :^f- Then P{r = r) ^ 1 as n,T ^ oo. This criterion estimates the total number of factors, 
including 1(0) factors. To estimate the number of 1(1) factors only, the criterion in Bai (2004) can 
be used. Ignoring the 1(0) factors still lead to consistent estimation of /?. However, our distribution 
theory assumes cross-sectional independence for the idiosyncratic errors; lumping 1(0) factors with 
the regression errors will violate this assumption. This suggests the use of Bai-Ng criterion. 
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4 Further issues 

The preceding analysis assumes that there are no deterministic components and that the regressors 
and the common factors are all 1(1) without drifts. This section considers construction of the 
estimator when these restrictions are relaxed. It will be shown that when there are deterministic 
components, we can apply the same estimation procedure to the demeaned or detrended series, 
and the Brownian motion processes in the limiting distribution are replaced by the demeaned 
and/or detrended versions. Furthermore, the procedure is robust to the presence of mixed I(1)/I(0) 
regressors and/or factors. Of course, the convergence rates for 1(0) and 1(1) regressors will be 
different, but asymptotic mixed normality and the construction of test statistics (and their limiting 
distribution) do not depend on the convergence rate. Finally, we also discuss the issue of testing 
cross-sectional independence. 

4.1 Incidental trends 

The Cup estimator can be easily extended to models with incidental trends, 

yit = ai + Pit + x[tl3 + >!iFt + Uit. (18) 
In the intercept only case (pj = 0, for all i), we define the projection matrix 

Mt = It- iTir/T 

where lt is a vector of I's. When a linear trend is also included in the estimation, we define Mt 
to be the projection matrix orthogonal to lt and to the linear trend. Then 

MrVi = Mrxip + MrFXi + Mtuu 

or 

yi = XiP + FfXi + Ui 

where the dotted variables are demeaned and/or detrended versions. The estimation procedure for 
the cup estimator is identical to that of Section 3, except that we use dotted variables. 

With the intercept only case, the construction of FM estimator is also the same as before. 
Theorems 1-3 hold with the following modification for the limiting distribution. The random 
processes B^^i and -B^ in Qi are replaced by the demeaned Brownian motions. 

When linear trends are allowed, Axit is now replaced by in = Axu — Ax^, which is detrended 
residual of xu- But since Xj is already a detrended series, and F is also asymptotically detrended 
(since it is estimating F), Axu and AFt are also estimating the detrended residuals. Thus we can 
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simply apply the same procedure prescribed in Section 3 with the dotted variables. The limiting 
distribution in Theorem 2 and consequently in Theorem 3 is modified upon replacing the random 
processes B^i and B^ by the demeaned and detrended Brownian motionslf] The test statistics {t 
and x^) have standard asymptotic distribution, not depending on whether the underlying Brownian 
motion is demeaned or detrended. 

When linear trends are included in the estimation, the limiting distribution is invariant to 
whether or not yit, xu and Ft contain a linear trend. Now suppose that these variables do contain 
a linear trend (drifted random walks). With deterministic cointegration holding (i.e., cointegrating 
vector eliminates the trends), the estimated [5 will have a faster convergence rate when a separate 
linear trend is not included in the estimation. But we do not consider this case. Interested readers 
are referred to Hansen (1992). 

4.2 Mixed I(0)/I(1) Regressors and Common Shocks 

So far, we have considered estimation of panel cointegration models when all the regressors and 
common shocks are 1(1). There are no stationary regressors or stationary common shocks. In this 
section we suggest that the results are robust to mixed I(1)/I(0) regressors and mixed I(1)/I(0) 
common shocks. Below, we sketch the arguments for the LS estimator assuming the factors are 
observed. If they are not observed, the limiting distribution is different, but the idea of argument 
is the same. 

Recall that the LS estimator is = (X^ILi x[MpoXi)~^ l^r=i ^'i^F^Ui- The term 

Mpoxi = {It - fO(fO'fO)-1fO')x, =Xi- F^6i 

with 5i = (F^' F^)~^ F^' Xi plays an important role in the properties of the LS. When xn and Ft are 
1(1), 5i = Op{l) and thus 

(M^oxQt _ Xit 5',F^ _ ^ 
We now consider this term under mixed 1(1) and 1(0) assumptions. 

1(1) Regressors, 1(0) Factors. Suppose all regressors are 1(1) and all common shocks are 1(0). 
With 1(0) factors, we have T'^F^'F^ ^ = Op (1). Thus 

d.= {T-'F^'FY^Y.^tx'u^^F' [ dB,B:, = 0,{l). 

^Alternatively, we can use in — ^ Yllt=i ^kto-ik in place of lS.Xit in Section 3. Similarly, we use j)^ = A_Ft — AF in 
place of AFt. 
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It follows that 



{Mpox 



it 



Xit 



Xit 



+ Op (1) 



and % 
Vt 



Vt Vt Vt 

B^i as T ^ cxD. The limiting distribution of the LS when the factors are 1(0) is the 



same as when all factors are 1(1), except that Qi is now asymptotically the same as B^i. For the 



FM, observe that the submatrix 17^ in 



n 



bi 



a 



rjei 



erji 



is a zero matrix since t] = AFj^ is an /(—I) process and has zero long-run variance. Similarly, 



flsrii is also zero. The submatrix ^u-qi in ^u.bi 



a, 



^ubi^bi ^bui as well as the submatrices 



( Arjui A^j ) in ( Ahui Af,j ) are also degenerate because the factors are 1(0). Note that Qbi 
is not invertible. Under appropriate choice of bandwidth, see Phillips (1995), i^'i^-^flbui can be 
consistently estimated, so that FM estimators can be constructed. This argument treats Ft as if it 
were 1(1). If it is known that Ft is 1(0), we will simply use Ft instead of AFt in the FM construction. 



1(1) Regressors, Mixed I(0)/I(1) Factors Consider the model 

yu = x'it/3 + X'uFit + X'2iF2t + Uit 



(19) 



where -Fit = Vit is ri x 1 and AF2t = f/2t is ^"2 x 1- We again have MpoXi = Xi — F^5i but 
Si = [ Sii 62i ]' ■ Then 

{MpoXi)t 



Vt 



hi hi 



Xit 1_ 

^ Vt 

'''' ^'^^^'^ +Op{l) 



F2% 



Xit 



Vt 



since 6ii = Op (1), 62i = Op (1) but = Op(l). The random matrix Qi involves Bg^i and -B2r?- In 
the FM correction, the long run variance {uu, Ax'^^, AF{^, AFgJ' is degenerate. With an appropriate 
choice of bandwidth as in Phillips (1995), the limiting normality still holds. 



Mixed I(1)/I(0) Regressors and 1(1) Factors Suppose k2 regressors denoted by X2it are 1(1) 
and ki regressors denoted by xut are 1(0). Assume Ft is 1(1) and uu is 1(0) as in ([3]). Consider 

Vit = ai + x'mPi + X2it/?2 + A-Ft + uu 

Ax2it = £2it- 
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With the inclusion of an intercept, there is no loss of generality to assume xm having a zero mean. 
For this model, we add the assumption that 

E{xumt) = (20) 

to rule out simultaneity bias with 1(0) regressors. Otherwise Pi cannot be consistently estimated. 
Alternatively, if Uit is correlated with xnt, we can project uu onto xut to obtain the projection 
residual and still denote it by Uit (with abuse of notation), and by definition, uu is uncorrelated 
with xiit- But then (3i is no longer the structural parameter. The dynamic least squares approach 
by adding Ax2it is exactly based on this argument, with the purpose of more efficient estimation 

of 02- 

If one knows which variable is 1(0) and which is 1(1), the situation is very simple. The 1(1) 
and 1(0) variables are asymptotically orthogonal, we can separately analyze the distribution of the 
estimated Pi and /?2- The estimated Pi needs no correction and is asymptotically normal, and the 
estimated P2 ^ distribution as if there is no 1(0) regressors except the intercept. Note that the 
FM construction for P2 is based on the residuals with all regressors included. The rest of analysis 
is identical to the situation of all 1(1) regressors with an intercept. 

In practice, the separation of 1(0) or 1(1) regressors may not be known in advance. One can 
proceed by pretesting to identify the integration order for each variable, and then apply the above 
argument. One major purpose of separating 1(0) and 1(1) variables is to derive relevant rate of 
convergence for the estimated parameters. But if the ultimate purpose is to do hypothesis testing, 
there is no need to know the rate of convergence for the estimator since the scaling factor n or T are 
cancelled out in the end. One can proceed as if all regressors are 1(1). Then care should be taken 
since the long-run covariance matrix is of deficient rank. Phillips (1995) shows that FM estimators 
can be constructed with appropriate choice of bandwidth. Interested readers are referred to Phillips 
(1995) for details. 

Finally, there is the case of mixed I(1)/I(0) regressors and mixed I(1)/I(0) factors. As explained 
earlier, 1(0) factors do not change the result. In practice, there is no need to know whether F° 
is 1(1) and 1(0), since the Cup estimator only depends on M^; scaling in F does not alter the 
numerical value of $cup- 

5 Monte Carlo Simulations 

In this section, we conduct Monte Carlo experiments to assess the finite sample properties of 
the proposed CupBC and CupFM estimators. We also compare the performance of the proposed 
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estimators with that of LSDV (least squares dummy variables, i.e., the within group estimator) 
and 2sFM (2-stage fully modified which is the CupFM estimator with only one iteration). 
Data are generated based on the following design. For i = 1, ...,n, t = 1, ...,r. 



Vit 
Ft 

Xit 



where ^ 



/ Uit ^ 




i 


' " 




Sit 









\ rit j 












Ixit + c \\iFt ) + Uit 
Ft-i + Vt 
Xit-i + £it 



1 cri2 cri3 
0'21 1 0-23 
fai 0-32 1 



(21) 



We assume a single factor, i.e., r = 1, Aj and r/^ are generated from i.i.d. A^(/^A) 1) -^(A*r?) 1) 
respectively. We set /i;^ = 2 and fj,^ = 0. Endogeneity in the system is controlled by only two 
parameters, a2i and a^i. The parameter c controls the importance of the global stochastic trends. 
We consider c = (5, 10) , CJ32 = 0.4, a2i = (0, 0.2, -0.2) and CJ31 = (0, 0.8, -0.8) . 

The long-run covariance matrix is estimated using the KERNEL procedure in COINT 2.0. We 
use the Bartlett window with the truncation set at five. Results for other kernels, such as Parzen 
and quadratic spectral kernels, are similar and hence not reported. The maximum number of the 
iteration for CupBC and CupFM estimators is set to 20. 

Table 1 reports the means and standard deviations (in parentheses) of the estimators for sample 
sizes T = n = (20, 40, 60, 120) . The results are based on 10, 000 replications. The bias of the LSDV 
estimator does not decrease as (n, T) increases in general. In terms of mean bias, the CupBC and 
CupFM are distinctly superior to the LSDV and 2sFM estimators for all cases considered. The 
2sFM estimator is less efficient than the CupBC and CupFM estimators, as seen by the larger 
standard deviations. 

To see how the properties of the estimator vary with n and T, Table 2 considers 16 different 
combinations for n and T, each ranging from 20 to 120. From Table 2, we see that the LSDV and 
2sFM estimators become heavily biased when the importance of the common shock is magnified as 
we increase c from 5 to 10. On the other hand, the CupBC and CupFM estimators are unaffected 
by the values of c. The results in Table 2 again indicate that the CupBC and CupFM perform well. 

The properties of the t-statistic for testing 13 = Pq, are given in Table 3. Here, the LSDV 
t-statistic is the conventional t-statistic as reported by standard statistical packages. It is clear 



^Random numbers for error terms, tJj) are generated by the GAUSS procedure RNDNS. At each rephca- 

tion, we generate an nT length of random numbers and then split it into n series so that each series has the same 
mean and variance. 
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that LSDV i-statistics and 2sFM t-statistics diverge as (n, T) increases and they are not well 
approximated by a standard N(0,1) distribution. The CupBC and CupFM t-statistics are much 
better approximated by a standard N(0,1). Interesting, the performance of CupBC is no worse 
than that of CupFM, even though CupBC does the full modification in the final stage of iteration. 

Table 4 shows that, as n and T increases, the biases for the t-statistics associated with LSDV 
and 2sFM do not decrease. For CupBC and CupFM, the biases for the t-statistics become smaller 
(except for a small number of cases) as T increases for each fixed n. As n increases, no improvement 
in bias is found. The large standard deviations in the i-statistics associated with LSDV and 
2sFM indicate their poor performance, especially as T increases. For the CupBC and CupFM, the 
standard errors converge to 1.0 as n and T (especially as T) increase. 

6 Conclusion 

This paper develops an asymptotic theory for a panel cointegration model with unobservable global 
stochastic trends. Standard least squares estimator is, in general, inconsistent. In contrast, the 
proposed Cup estimator is shown to be consistent (at least T-consistent) . In the absence of en- 
dogeneity, the Cup estimator is also s/nT consistent. Because we allow the regressors and the 
unobservable trends to be endogenous, an asymptotic bias exists for the Cup estimator. We further 
consider two bias-corrected estimators, CupBC and CupFM, and derive their rate of convergence 
and their limiting distributions. We show that these estimators are s/nT consistent and this holds 
in spite of endogeneity and in spite of spuriousness induced by unobservable 1(1) common shocks. A 
simulation study shows that the proposed CupBC and CupFM estimators have good finite sample 
properties. 
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Appendix 



Throughout we use {n,T)^^g — > oo to denote the sequential hmit, i.e., T — > oo first and followed 
by n ^ oo. We use MN{0, V) to denote a mixed normal distribution with variance V. Let C be 
the (T-field generated by {Fj^} ■ The first lemma assumes Ui is uncorrelated with {ui,F^) for every 
i. This assumption is relaxed in Lemma lA.2[ 

Lemma A.l Suppose that Assumptions 1-5 hold and that ui is uncorrelated with {xi,F^), then as 
(a) 

1=1 i=l ^ 

\x,Mp.n, MN (^0, Jim ^f^n^.E {^j Q.QIIc)^ . 

Proof. Note that 

I I , l" l^ , 

- E T^^'i^F^Xi = - E l^XiMpoMpoXi = - E ^ E 
i=l 1=1 i=l t=l 

where xu = xu — S^F^ and 



see, e.g., Phillips and Ouliaris (1990). Thus 

Xjt _ Xit_ _ r' Ff_ d 

Vt~Vt 'Vt^ 



Bn — Qi 



By the continuous mapping theorem 

T 

J^2 



1 

^ XitXit > / QiQi = Cli 



t=l 

as T ^ OO. The variable Cij is independent across i conditional on C, which is an invariant cj-field. 
Thus conditioning on C, the law of large numbers for independent random variables gives, 

n n n . . 

-Y.^l^^ 1™ - Y.^{(:u\c)= lim - Y,e( q,q[\c). 

n ^ — ' n^oo n ^ — ^ n^oo n ^ — ^ \ / / 

1=1 i=l i=l ^ 
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Thus, the sequential hmit is 



-f^;^x;M^ox,^ hm -Y^eI j Q^Q\\C] 
1=1 i=l ^-^ ^ 



This proves part (a). 

Consider (b). Rewrite 



i=l i=l t=l 



where xa = xu — ^iF^ as before. By assumption, uu is 1(0) and is uncorrelated with xa- It fohows 
that ^ 



where Z ^ N {0,lk) as T ^ oo for a fixed n. The variable is independent across i conditional 
on C, which is an invariant cr-field. Thus conditioning on C, 

- E ^2^4 hrn-Y^E U2iQc) = hm - E / E (q^Q[\c) . (22) 

i=\ i=l i=l •' 

Let li be the a field generated by {F^*} and (C2I) ■••) '?2j)- Then {C2iili',i > 1} is a martingale 
difference sequence (MDS) because {^2i} independent across i conditional on C and 

From Y17=i ^2i^2i = Op (n), the conditional Lindeberg condition in Corollary 3.1 of Hall and Heyde 
(1980) can be written as 

- E ^ (^2.41 (11^2.11 > ^ (23) 

i=l 

for all 5 > 0. To see (1231). notice that 



^ n 1 

- (^2.41 (11^2^11 > v^^) = - E^ (^2.41 (11^2^11 > v^s) \c 

1=1 i=l 

Without loss of generality we assume ^ scalar to save notations. By the Cauchy-Schwarz 

inequality 

E (4l (11^2.11 > \C) < {E iCilC)}'^' {E [I (11^2.11 > V^6) \C] . 
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Furthermore, 



It follows that 



E[im\\>v^s)\c]<^^i 



1 " 1 r 1 

1=1 L j=i 



1/2 



0,(n-V2) 



in view of 



1 " 



i=l 



This proves ()23p . The central limit theorem for martingale difference sequence, e.g.. Corollary 3.1 
of Hall and Heyde (1980), implies that 

1/2 



V i=l L i=l 



X Z 



(24) 



where Z (0, 1) and Z is independent of lim„^oo ^ Z^^Li (^^2j'?2i|C'j . Note that 

hm -Vi^(e2^4|C^) = lim / 
1=1 J \ i=l ^-^ 



Thus, as {n,T)g^^ oo, we have 



11" ^ ^ 



i=l i=l 

which is a mixed normal. The above can be rewritten as 



lim ^E^^i^ (/ QiQ'i\C^ 



1/2 



X Z 



-^^y^y^xuUit^MN fo, lim ^y2^u^E( [ Q^Q[\c]]. 
This proves part (b). ■ 

The proofs for Propositions [1] and [2] (with observable F) follow immediately from Lemma lA.ll 
Propositions 3 and 4 are proved in the supplementary appendix of Bai et al. (2006). 

To derive the limiting distribution for Pcup^ need the following lemma. Hereafter, we define 
6nT = min{^,r} . 



Lemma A. 2 Suppose Assumptions 1-5 hold. Let Zi = MpoXi — -Ylk=i^^Fo^k(iik- Then as 
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(a) 

1 " 1 " / r \ 

i=l i=l ^"^ ^ 

//-Uj is uncorrelated with {xi,F^) for all i, then 

1 " / 1 " / /" 

E ^^"^ ^ MiV 0, Jim - E ^u^E { / 

(c) If Ui is possibly correlated with {xi,F^), then 

* i=l \ i=l ^ 

w;/tere 

1 " 
n 

k=l 

a-ik = A- (^A'A/n^ Afe, 

= B^i—(^j B^iB^ (^j Br^B^ Bjj 

wii/t Si = an(i Xi = Xi - ^ ^k=i ^kO-ik- 

Proof of (a). Recall 

Mpoxi = Xi-F^ (^F°'f°) ~^ F^'xi = Xi- F^Si 

where 

^^ = " = ' ^ E ^ (/ 5.^;) " / Br,B:, = 

IS ek r X k matrix as T — > oo. Write Xi = MpoXi = Xi — F^6i, aT x k matrix. Hence 



1 

Zi = MpoXi y^^MpoXkUik 

fe=i 

^ n 1 
= {xi - F°6i) E (^'fe ~ ^^^'') ^ik = Xi E 



n ^ — ' n — ' 

k=l k=l 
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where Oik = Aj ( A'A/n) Afc is a scalar and 



T~ Vf 'Vt^ 

a k X 1 vector, as T ^ oo. It follows that 

and using similar steps in part (a) in Lemma I A. II as n — > cx), 



— Qi 



Hence 



i=l i=\ ^ 



\-^Y^Z\z,^ Jim i Y.e{\ 



as {n,T)g^^ oo, showing (a). 
Proof of part (b). Notice that 



—=- V Z-Ui = —=- V Mpoxi - - V MpoXka 

^ n 1 " / 1 " 



= h + Ih. 
Ih is proved in Lemma [A.ll as {n,T)seq oo, 



if and liji are uncorrelated. Similarly, for lib, we have 

V (- V aifcM^oXfc) ni ^ MiV 0, lim - V Q^iE {Cni\C) 
^ 1=1 k=i \ i=i / 

where Cni = ^ Y!k=i ^ik I QkQ'k we have used the fact that ^ Ylk=i YTj=i ^^ikaij = ^ Ylk=i ^ik- 
Thus both If) and Ilf, have a proper limiting distribution. These distributions are dependent since 
they depend on the same Ui. We can also derive their joint limiting distribution. Given the form 
of Zi, it is easy to show that the above convergences imply part (b). 
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Proof of part (c). Now suppose xn and Uit are correlated. It is known that 



1 ^ 

- ^ XitUit 



t=i 



1 ^ ^ T 

t=l t=l 

T 

[/ 



BprdB., 



'<)( 



5^ dBy^i 



A, 



A, 
A, 



^(4 -<) 
= I QidBui + ( 4 -TT- ) 1 A 
as r — 00 (e.g., Phillips and Durlauf, 1986). First we note 



= J QidBu.bi + J QidBi^^Q^^^Qbui 



such that 



E 



I 



QidVi 



E 
E 



QidVi 



= 0. 



Note that 



^x'iMpo ( Axi AF ) n^^n^ui 
^x'i ( Axi AF ) n-^n^ui 

(4 ~<^j ) r ^ ( F° ) ^w^^''"^ ( 



(4 -vrl) 



Br, 



Therefore 



1 / 

T 



1 / 
— X, 



x-Mf ( Axi AF ) + ( 4 -S'i ) [Ab^i - AbiO^/ 

Mpo ( Axi AF ) n-^n^ui + ( 4 -s'i ) A+ 
/" r f 1^^^ 



27 



where 



Let 



^bui — ^bui ^bi^ifi ^bui- 



i=l '- 

Then we use similar steps in part (b) in Lemma lA.ll to get 

^ n ^ n T 

MN ^0, QiQ'i\C 
as in,T)^^g oo. 

Note Zi = Xi — ^ Ylk=i ^kO-ik is a demeaned Xi where ^ Z^fc=i Xkaik is the weighted average of 
Xi with the weight Oj^. It fohows that 



Xi / Xf^CLil; 

n 

k=l 



[Xi 



Xi 



\ n 

Xi - F% 

1 1 I. _ 1 V^n 



n E' 

fc=i 

- ^ (^fe - F^h) aik 

k=l 

- ^ Xkaik ] - F° iSi - - ^ 6kaik J 
" k=i J y n j 



where Xi = Xi- - YJk=\ ^kO-ik and 6i = 5i - - YJk=i ho-ik- 
We then can modify psp as 



^ T 1 



r 



- 6iF^ ] Uit 



tJ2( ^k 



t=i 



Xit 



Uit 



-7T- 



Ik -Si JrY^ 



t=l 



Xit 



Uit 



+ 



A, 



j RnidBui + ( h -vr- ) ^ ^""^ ^ 



(27) 



where 5^, = B^i - ^ I]fc=i -Beiajfc and 

1 " 

= (^j /^haik 



k=l 



-1 



BriB^i = TTi. 
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The Rni terms appears in the last hne in ()27p because 



Bei - T^iBr, 



Let 



Clearly 



1 " 

n ^-^ 

k=l 



k^ik 



B^B^ 



-1 



B,^ Bei 



1 " 
n ^-^ 

k=l 



k'^ik I Bfj 



B, 



j B^B^ j B^B^i B^ - -Y^^I^Bsk - (^j Br^B^ j 



1 " 

Qr--y^Qi 



k=l 



1 " 

-E 



n 



k^ik — Rni- 



1 „, 



hui 



BrjB^,. 



Br, 



1 " 



^E ---E 



fc=l 



as (n, T ^ oo) with Rni = Qi - \ Ylk=i QkO-ik- This proves (c). ■ 
Proof of Theorem [Tl 

This follows directly from Lemma I A. 21 as {n,T) — > oo when ^ ^ 
V^T (Pcup - /?) - V^(l>nT MN (^0, D^' ^lim i n^^E {^j RniR^ni\C^ 



D 



-1 



nT' 



-1 



where Dz = Jl™^^ I]"=i ( / -Rm-^nJC'j and (p^r 

Proof of Theorem 2 and 3. The proof for Theorem 2 is similar to that of Theorem 3 
below, thus omitted. To prove Theorem 3, we need some preliminary results. First we examine 
the limiting distribution of the infeasible FM estimator, PcupFM- The endogeneity correction is 
achieved by modifying the variable ya in ([3]) with the transformation 



ytt = Vit - ^ubi^u 



afo 



and 



+ - o o-i ( 
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By construction has zero long-run covariance with ^ Ax AF^ ' j and hence the endo- 
geneity can be removed. The serial correlation correction term has the form 

h 



_eui 



Q — 1q 

bi bui 



where Ab^jj denotes the one-sided long-run covariance between uu and {eit,r]^). Therefore, the 
infeasible FM estimator is 

/ n \ ~1 n 

PcupFM = Yl ^'iMpoXi Yl U^Foyt - T (A+ . - ~5\K+u 



i=l 



pO'jpO] ^ pO'^ 



with 5i 

The following Lemma gives the limiting distribution of PcupFM- 
Lemma A. 3 Suppose Assumptions in TheoremUlhold. Then as (n, T)^^^ oo 



MN 0,Dz^ 



lim -S2^ume( I RniR'nilC 
i=l 



Proof. Let w2 



where 



uf^ T] ) and we have 



[Tr] 



B^ 

Be 

Br, 



Bei 
Br, 

^u.bi 

n^i 



Bbi 



BM{nf) asT^oo, 



n 



u.bi 







^erii 
^riei 







Bu 





i;+ + r+ + r+', 

I 





Bui 




Bbi 



(28) 
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Define = T,f + Tf. and let uf^^ = uu — O^bjOj^j^ ^ AF* ) ' ^^^'^^ notice from 
|A2]tliat 



EBl) in Lemma 



liT 



T T 

t=l t=l ^ * ^ 



Ik -d[ 



t=l 



nit 



Xit 



^ImJ g.d^. + (a+ , - ^:a+ 



(29) 



as T ^ cxD. Now let 



Clearly, 



Thus. 



liT ~ [ ^eui ~ ^i\u I ■ 



Cut = C 



1 " 

-j=^ {x[Mpou+ - T (a+ , - ,5;A+ 

^ i=l 



as {n,T) — > oo. Next, we modify ([29]). 



7=^ ^ - T (a+ , - a+ ) ^ 



1 ^ 



i=l 



4 -I. 



t=i 



h -vf • ) 



/ 



Be 

Br 



^0 I ''^ubi^'^bi \ ^pO 



B 
B 



RnidBui + { Ik — 
1/2 



A 

A. 



■eul 



Km / i?„^'il4+(A+, -vf,;A, 



"qu 

'eui ^ "i^'rju 



J RnidB^^Q.^^,^Vti,ui + ( h — ) 



Therefore, 



-|^|;(z:<-r(A+,-^A+)) ^MiV (^0, Y.mJ-jyi^ME l^j 
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as (n, r) — oo. Then 



MN 



as (n, — oo. This proves the theorem. ■ 

To show \/riT [PcupFM ~ PcupFM^ = Op (1), we need the following lemma. 

Lemma A. 4 Under Assumptions of 1-6, we have 

(a) ^ (a+ „ - A+ „) = Op (1) , 

(h) ^ELi (<A+ -<A+)=Op(l), 



(c) E"=i (x[Mpuf - x'iMpou^ 



where = - ^ubi^u 



A+ — i V" A+ nnrl A+ — i V" A+ 



Note that the lemma holds when the long run variances are replaced by the bar versions. Since 
the proofs are basically the same (as demonstrated in the proof of Theorem 1), the proof is focused 
on the variances without the bar. 
Proof. First, note that 



Then 



A+ — A • — A ■n*~-^n • 



where is the first k x k block of Following the arguments as in the proofs of Theorems 
9 and 10 of Hannan (1970) (also see similar result of Moon and Perron (2004)), we have 

2 



E 

< supE 



(A+ n - A+ „ 



A+ . - EA+ ■ 

eut em 



+ nsup 

i 



EA+ . - A+ . 

em em 



It follows that 



{^tun - ^tun) = Op ^max 
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From Assumption [6l K ^ n^. Then 



n n 



if 1 < 2qb or ^ < 6. Next 

.6 / /^b 



K ( fn'\\ ( \ogT\ b-'^ . b-liminflH£Z 

— >^ — = exp log — = exp log n = n " < n log n ^ y 

T T V \T J J V logny 

if 6 < lim inf by Assumption [6l Then 



Vni'^tun- ^tun] = Op ( max ^ " 




Opil) 



as required. This proves (a). 
To estabhsh (b), we note 



1 " / 1 " \ ^ 

i=i \ i=i / 

= Op(l)Op(^max|y|,y^|^ =Op(l) 

as required for part (b). 

Let uf^ = Uit - ^ubi^bi ^ • 

1 " 

i=l 
1 " 

= 7^ (^i^F^*^ - ^'i^F^t + - x;Mpn+ + x'^Mput - x-Mpoul^ 

i=l 

^ n ^71 1 " 

= 7^ E (^i^F^f.^ - + 7^ E (^i^^i?^^ - ^rMp^ut) +-7=^Y. {^'^Mput - x[Mput) 

^ i=l ^ i=l ^ i=l 

^ n 1 " 1 " 

= 7^ E ^'^^F {"i^t - O + 7^ E {^'i^F - ^'^^Fo) < + 7=^ E ^^^i? ("^t - ^t) 

^ i=l ^ i=l ^ i=l 

^ n 1 " 1 " 

= 7^ E ^^^F (s.^ - <) + 7^ E ^U^F - < + 7^ E ^^^F (s.'' - ^'') 

j=l j=l i=l 

= I + 11 + III. 

From the proof of Proposition H] in the supplementary appendix, 

x; {Mp - Mpo) uf = Op (1) 



// = 

i=l 
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if we replace Ui by uf. Let Abi = ( Axi AF ) be a T x (/c + r) matrix. Consider /. 

^ i=l ^ 1=1 

1 

^ 1=1 

\ ^ ( ff' \ 

1 " 
1=1 

i=l 

= Ic + IIc- 

Along the same lines as the proofs of Theorems 9 and 10 of Hannan (1970), we can show that 



sup£^ 

i 



^ubi^bi ^ubi^bi 



Then we have 



^ubi^bi^ - ^ubi^bi^ = Op ^Max I 



K / 1 



and 



1 " II I 2 1 " II 

yn II 71 II 

1=1 j=l 



Op Max ./-, 



K / 1 



For /c, by the Cauchy Schwarz inequality, 



1 " 



< Vn 



1 " 



1/2 



\^ i=l 

< [OAV^)f\V^f'Op(MaxS^^,^yj 



1/2 



IK I 1 
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Similarly, 
WILW 



1 ,i±' 



,FF 

^ i=l 

1 ^x'.FF'AL 



i=l 



^^ubi^^'bi ^^ubi^^'bi 



< 



n ^ 



i=l 



x'.FF'Abi 



rp2 



1/2 



1 II ^ ^ 

\ ^ i=l 



1/2 



K / 1 



Combining Jc. and ^^c, we have 
1 " 



Op Max 



nK / n 



Recall K ^ and liminf > 1 from Assumption [H It follows that, as in Moon and Perron 
(2004) 



nK 



n 



b+l 



T 



exp ( log 



n 



b+l 



6+1-1 ^ 

n < n 



T 

6+1— lim inf 



exp + 1 — log n 



logn 



log n — y Q 



by Assumption [6] and b < lim inf — 1 . Also note 



n n 



J^2q ^2qb 



n 



(l-2g6) 



by Assumption [6] and ^ < b. Therefore 



1 " 

^ j=i 



Let 



Note that 



Abi = Axi AF 



Op{l). 



Ah: - Abi = ( Axi AF 



Axi AF 



AF - AF 
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Consider III. 



^ i=i ^ i=i 

1 " 

i=l 
1 " 

i=l 

We use Lemma 12.3 in Bai (2005) to get 



nT 

It follows that 



^|:.:Mp(aF-Af) = v^[o,(3-/3»)+0,(-J^) 



since y — > as (n, T) — > oo. Collecting / — /// we prove (c). ■ 
Proposition A. 1 Under Assumptions\T^ 

[f^CupFM ~ PcupFM ) = Op (1) • 



Proof. To save the notations, we only show that results with Xi in place of Xi and Si in place of 
of 6i since the steps are basically the same. In the supplementary appendix, it is shown that (see 
the proof of Proposition S]) 



Then 



Xi] +0p (1) 



yPcupFM ~ PcupFM 

( 1 V- 'M V 1 / sr=i(^-;MFS+-T(A+„,-a,%, . 
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where A+„ = i Y.i=i '^tm andA+ „ = i Ya=i ^tm- Finally using Lemma E31 

{j^CupFM ~ f^CupFM^ = Op (1) • 

■ 

Proof of Theorem [Sj This follows directly from Proposition lA.li ■ 
Proof of Proposition [5} In the supplementary appendix, it is shown that 

T 

t=i 

From y/nT{[3 — P^) = Op(l), the first term on the right hand side is Op(l/(nT)), which is dominated 
byO(l/n). ■ 
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Table 1: Mean Bias and Standard Deviation of Estimators 



LSDV 



(T31 = 
2sFM CupBC 



CupFM 



LSDV 



(731 = 0.8 

2sFM CupBC 



CupFM 



LSDV 



<731 

2sFM 



= -0.8 

CupBC CupFM 



0'21 

n,T= 
n,T= 
n,T= 
n,T= 

0'21 = 

n,T= 
n,T= 
n,T= 
n,T= 

0"21 = 

n,T= 
n,T= 
n,T= 
n,T= 



= 
=20 

=40 

=60 

120 

0.2 

=20 

=40 

=60 

120 

-0.2 
=20 

=40 

=60 

120 



1.352 

1.559) 

3.371 

1.139) 

■2.006 

0.920) 

0.204 

0.645) 

4.333 

1.584) 

4.567 

1.133) 

-1.100 

0.923) 

0.696 

0.648) 

-1.600 

1.588) 

2.086 

1.144) 

-2.850 

0.917) 

-0.501 

0.650) 



0.349 

0.387) 

0.719 

0.225) 

0.094 

0.138) 

-0.064 

0.056) 

0.317 

0.385) 

-0.768 

0.223) 

0.109 

0.138) 

0.059 

0.055) 

0.376 

0.393) 

-0.653 

0.225) 

0.008 

0.142) 

0.000 

0.057) 



0.030 

(0.030) 
-0.000 
(0.009) 
-0.000 

(0.005) 
-0.000 
(0.018) 

-0.119 
(0.030) 
-0.113 
(0.010) 
-0.071 
(0.005) 
0.000 
0.018 

0.179 
(0.031) 

0.105 
(0.010) 

0.055 
(0.005) 

0.000 
(0.002) 



0.030 

0.029) 
-0.000 
0.009) 
-0.000 
0.005) 
-0.000 
0.002) 

0.332 

0.029) 

0.100 

0.009) 

0.045 

0.005) 

0.178 

0.002) 

-0.274 

0.029) 

-0.108 

0.009) 

-0.062 

0.005) 

0.000 

0.018) 



-0.712 

1.505) 
2.761 
1.529) 
-1.393 
0.915) 
0.548 
0.646) 

2.258 

1.529) 

4.051 

1.153) 

-0.337 

0.925) 

1.161 

0.649) 

-3.763 

1.593) 

0.812 

1.141) 

-2.178 

0.905) 

-0.175 

0.646) 



0.257 

0.372) 

-0.246 

0.227) 

0.038 

0.139) 

-0.062 

0.056) 

0.129 

0.382) 

-0.333 

0.227) 

0.082 

0.139) 

-0.070 

0.055) 

0.331 

0.345) 

-0.077 

0.223) 

-0.018 

0.136) 

-0.000 

0.057) 



0.000 

(0.030) 
-0.000 
(0.010) 
-0.000 
(0.005) 
-0.020 
(0.002) 

-0.158 
(0.031) 
-0.117 
(0.010) 
-0.067 
(0.005) 
-0.017 
(0.002) 

0.151 
(0.031) 

0.101 
(0.010) 

0.058 
(0.005) 
-0.018 
(0.002) 



0.000 

(0.029) 
-0.000 
(0.009) 
-0.000 
(0.005) 
0.015 
(0.002) 

0.293 
(0.029) 

0.101 
(0.009) 

0.049 
(0.005) 

0.017 
(0.002) 

-0.291 
(0.029) 
-0.113 
(0.009) 
-0.058 
(0.005) 
0.017 
(0.002) 



2.216 

1.524) 

1.010 

1.124) 

■1.073 

0.929) 

-0.163 

0.643) 

4.903 

1.614) 

1.964 

1.120) 

0.032 

0.938) 

0.151 

0.646) 

-0.754 
1.603) 
■0.353 
1.128) 
-1.872 
0.921) 
■0.839 
0.654) 



■0.086 

0.394) 

-0.371 

0.217) 

0.199 

0.138) 

-0.061 

0.056) 

-0.220 

0.396) 

-0.376 

0.216) 

0.150 

0.140) 

-0.026 

0.055) 

-0.049 

0.394) 

■0.313 

0.218) 

0.236 

0.138) 

0.029 

0.058) 



0.030 

0.029) 

-0.000 

0.009) 

-0.000 

0.005) 

0.018 

0.002) 

-0.117 

0.030) 

-0.115 

0.010) 

-0.065 

0.005) 

0.017 

0.002) 

0.169 

0.031) 

0.096 

0.010) 

0.056 

0.005) 

0.000 

0.002) 



0.030 

0.029) 
-0.000 
0.009) 
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0.005) 
-0.000 
0.002) 

0.322 

0.028) 

0.102 

0.009) 

0.051 

0.005) 

-0.017 

0.002) 

-0.274 
0.029) 
■0.112 
0.009) 
-0.060 
0.005) 
■0.000 
0.002) 



Note: (a) The Mean biases here have been multip^ 
(b) C = 5, C732 = 0.4. 



led by 100. 



Table 2: Mean Bias and Standard Deviation 
of Estimators for Different n and T 



c = 5 


c = 10 


(n,T) 


LSDV 


2sFM 


CupBC 


CupFM 


LSDV 


2sFM 


CupBC 


CupFM 


(20, 20) 


2.258 


0.129 


-0.158 


0.293 


1.538 


0.275 


-0.158 


0.294 




(1.594) 


(0.382) 


(0.031) 


(0.028) 


(3.186) 


(0.771) 


(0.031) 


(0.029) 


(20,40) 


4.832 


-0.426 


-0.067 


0.107 


8.141 


-0.006 


-0.067 


0.106 




(1.692) 


(0.288) 


(0.014) 


(0.014) 


(3.186) 


(0.566) 


(0.014) 


(0.014) 


(20, 60) 


0.460 


0.282 


-0.019 


-0.058 


-0.105 


0.0561 


-0.186 


0.058 




(1.560) 


(0.206) 


(0.009) 


(0.009) 


(3.121) 


(0.412) 


(0.009) 


(0.009) 


(20, 120) 


3.018 


0.040 


0.010 


0.021 


-6.550 


0.067 


0.010 


0.021 




(1.572) 


(0.123) 


(0.005 


(0.005) 


(3.144) 


(0.245) 


(0.005) 


(0.004) 


(40, 20) 


4.012 


-0.566 


-0.225 


0.320 


5.092 


-1.087 


-0.226 


0.320 




(1.126) 


(0.280) 


(0.0218) 


(0.019) 


(2.252) 


(0.593) 


(0.021) 


(0.019) 


(40,40) 


4.051 


-0.332 


-0.117 


0.101 


6.616 


-0.622 


-0.117 


0.101 




(1.153) 


(0.227) 


(0.010) 


(0.009) 


(2.305) 


(0.454) 


(0.010) 


(0.009) 


(40, 60) 


1.818 


0.114 


-0.055 


0.051 


2.628 


0.248 


-0.055 


0.051 




(1.098) 


(0.158) 


(0.007) 


(0.006) 


(2.196) 


(0.317) 


(0.007) 


(0.006) 


(40, 120) 


1.905 


-0.090 


-0.010 


0.015 


3.303 


-0.178 


-0.010 


0.015 




(1.111) 


(0.087) 


(0.003) 


(0.003) 


(2.243) 


(0.187) 


(0.003) 


(0.003) 


(60, 20) 


3.934 


-0.317 


-0.294 


0.295 


4.989 


-0.544 


-0.294 


0.295 




(0.921) 


(0.249) 


(0.018) 


(0.017) 


(1.841) 


(0.497) 


(0.014) 


(0.016) 


(60,40) 


2.023 


0.110 


-0.125 


0.108 


2.573 


0.267 


-0.125 


0.109 




(0.923) 


(0.187) 


(0.009) 


(0.008) 


(1.296) 


(0.027) 


(0.009) 


(0.008) 


(60,60) 


-0.337 


0.082 


-0.067 


0.049 


-1.666 


0.191 


-0.067 


0.049 




(0.925) 


(0.139) 


(0.005) 


(0.005) 


(1.850) 


(0.279) 


(0.005) 


(0.005) 


(60, 120) 


-1.168 


0.109 


-0.015 


0.015 


-2.839 


-0.223 


-0.014 


0.015 




[O.yzd ) 


[U.Ulo) 


(U.UOo j 


(U.UOoj 


(l.o4 ( j 


(U.ioij 


(O.OUoj 


(O.OUoj 


(120, 20) 


2.548 


-0.151 


-0.304 


0.294 


2.236 


-0.203 


-0.304 


0.294 




(0.651) 


(0.182) 


(0.014) 


(0.011) 


(1.303) 


(0.362) 


(0.014) 


(0.011) 


(120,40) 


1.579 


-0.026 


-0.013 


0.001 


1.678 


0.000 


-0.133 


0.112 




(0.661) 


(0.137) 


(0.006) 


(0.005) 


(1.321) 


(0.279) 


(0.006) 


(0.005) 


(120,60) 


0.764 


0.004 


-0.077 


0.013 


0.539 


0.061 


-0.077 


0.048 




(0.634) 


(0.100) 


(0.004) 


(0.004) 


(1.267) 


(0.199) 


(0.004) 


(0.004) 


(120, 120) 


1.161 


-0.070 


-0.017 


0.017 


1.823 


-0.134 


-0.017 


0.018 




(0.649) 


(0.055) 


(0.002) 


(0.002) 


(1.298) 


(0.111) 


(0.002) 


(0.002) 



(a) The Mean biases here have been multipHed by 100. 



(b) (721 = 0.2, (731 = 0.8, and (732 = 0.4. 
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Table 3: Mean Bias and Standard Deviation of t-statistics 



LSDV 



0-31 = 

2sFM CupBC CupFM 



0-31 = 0.8 (731 = -0.8 

LSDV 2sFM CupBC CupFM LSDV 2sFM CupBC CupFM 



(721 = 
n,T=20 

n,T=40 

n,T=60 

n,T=120 

(721 = 0.2 
n,T=20 

n,T=40 

n,T=60 

n,T=120 

(721 = -0.2 

n,T=20 
n,T=40 
n,T=60 
n,T=120 



0.036 
(2.414) 

0.092 
(3.576) 
-0.098 
(4.346) 

0.046 
(6.093) 

0.104 
(2.508) 

0.149 
(3.563) 
-0.032 
(4.357) 

0.049 
(6.060) 

-0.031 
(2.519) 
0.033 
(3.586) 
-0.162 
(4.335) 
-0.066 
(6.098) 



0.006 

2.445) 

-0.036 

2.589) 

0.016 

2.647) 

-0.019 

2.696) 

0.040 

2.454) 

-0.013 

2.597) 

0.039 

2.651) 

-0.016 

2.640) 

-0.013 

2.456) 

■0.068 

2.593) 

0.002 

2.657) 

0.001 

2.679) 



0.016 

1.531) 
-0.007 
1.276) 
-0.019 
1.182) 
-0.003 
1.101) 

0.001 

1.558) 

-0.081 

1.304) 

-0.100 

1.209) 

0.003 

1.096) 

0.029 

1.559) 

0.067 

1.312) 

0.062 

1.212) 

0.007 

1.106) 



0.016 

1.502) 
-0.006 
1.256) 
-0.019 
1.169) 
-0.003 
1.096) 

0.185 

1.497) 

0.140 

1.252) 

0.115 

1.167) 

0.002 

1.092) 

-0.155 

1.497) 

-0.153 

1.255) 

-0.154 

1.169) 

0.007 

1.106) 



0.006 

(2.527) 
0.074 
(3.592) 
-0.036 

(4.325) 
0.099 
(6.089) 

0.070 
(2.529) 

0.134 
(3.578) 

0.027 
(4.357) 

0.097 
(6.084) 

-0.064 
(2.528) 
-0.005 
(3.597) 
-0.093 
(4.283) 
-0.010 
(6.152) 



0.0224 

2.449) 
-0.052 
2.618) 
-0.016 
2.640) 
-0.019 
2.661) 

0.037 

2.453) 

-0.022 

2.639) 

0.013 

2.647) 

-0.019 

2.645) 

0.005 

2.439) 

-0.071 

2.618) 

-0.035 

2.633) 

0.022 

2.577) 



0.001 

1.529) 
-0.012 
1.273) 
-0.011 
1.189) 
-0.075 
1.118) 

-0.013 
1.561) 
-0.085 
1.307) 
-0.094 
1.215) 
-0.059 
1.115) 

0.125 

1.556) 

0.061 

1.305) 

0.067 

1.210) 

-0.062 

1.116) 



0.001 

1.503) 

-0.011 

1.254) 

-0.011 

1.178) 

0.102 

1.094) 

0.188 

1.442) 

0.142 

1.252) 

0.123 

1.174) 

0.114 

1.093) 

-0.166 

1.498) 

-0.162 

1.248) 

-0.146 

1.168) 

0.117 

1.092) 



0.041 

(2.534) 
0.019 
(3.588) 
-0.060 
(4.315) 
-0.088 
(6.095) 

0.105 
(2.539) 

0.059 
(3.578) 

0.011 
(4.325) 

0.012 
(6.043) 

-0.031 
(2.538) 
-0.035 
(3.588) 
-0.114 
(4.308) 
-0.111 
(6.119) 



-0.001 

2.455) 

0.008 

2.581) 

0.045 

2.644) 

-0.040 

2.705) 

0.033 

2.465) 

-0.003 

2.612) 

0.038 

2.646) 

-0.029 

2.635) 

-0.029 

2.458) 

-0.021 

2.574) 

0.028 

2.643) 

-0.004 

2.691) 



0.019 

1.515) 

-0.006 

1.278) 

■0.009 

1.182) 

0.068 

1.120) 

0.004 

1.543) 

-0.081 

1.314) 

-0.087 

1.204) 

0.062 

1.111) 

0.027 

1.556) 

0.058 

1.305) 

0.067 

1.206) 

0.077 

1.125) 



0.019 

1.491) 
-0.005 
1.217) 
-0.009 
1.169) 
-0.011 
1.095) 

0.181 

1.483) 

0.143 

1.258) 

0.127 

1.162) 

-0.109 

1.089) 

-0.152 
1.498) 
-0.159 
1.252) 
-0.147 
1.166) 
-0.104 
1.101) 



Note: (a) c = 5, (732 = 0.4. 



Table 4: Mean Bias and Standard Deviation 
of t-statistics for Different n and T 



c = 5 


c= 10 




LSDV 


2sFM 


CupBC 


CupFM 


LSDV 


2sFM 


CupBC 


CupFM 


(20, 20) 


0.070 


0.037 


-0.013 


0.169 


0.036 


0.030 


-0.013 





169 




(2.529) 


(2.453) 


(1.561) 


(1.497) 


(2.532) 


(2.562) 


(1.560) 


(1 


496) 


(20, 40) 


0.130 


-0.007 


-0.009 


0.110 


0.106 


-0.011 


-0.009 





110 




(3.539) 


(1.863) 


(1.313) 


(1.286) 


(3.541) 


(1.896) 


(1.313) 


(1 


286) 


(20, 60) 


0.029 


0.009 


0.015 


0.085 


0.009 


0.003 


0.016 





085 




(4.303) 


(1.553) 


(1.253) 


(1.239) 


(4.305) 


(1.569) 


(1.253) 


(1 


239) 


(20, 120) 


-0.090 


0.015 


0.057 


0.064 


-0.105 


0.013 


0.057 





064 




(6.131) 


(1.222) 


(1.156) 


(1.151) 


(6.132) 


(1.220) 


(1.156) 


(1 


151) 


(40, 20) 


0.119 


-0.015 


-0.086 


0.242 


0.073 


-0.019 


-0.086 





241 




(2.518) 


(3.376) 


(1.549) 


(1.443) 


(2.520) 


(3.610) 


(1.549) 


(1 


443) 


(40, 40) 


0.134 


-0.022 


-0.085 


0.142 


0.100 


-0.026 


-0.085 





142 




(3.578) 


(2.639) 


(1.307) 


(1.252) 


(3.580) 


(2.739) 


(1.307) 


(1 


252) 


(40, 60) 


0.113 


0.012 


-0.048 


0.109 


0.085 


0.008 


-0.047 





109 




(4.328) 


(2.164) 


(1.209) 


(1.177) 


(4.329) 


(2.222) 


(1.209) 


(1 


176) 


(40, 120) 


0.133 


-0.014 


-0.007 


0.059 


0.113 


-0.019 


-0.007 





059 




(6.097) 


(1.519) 


(1.131) 


(1.123) 


(6.098) 


(1.535) 


(1.131) 


(1 


123) 


(60, 20) 


0.123 


0.005 


-0.161 


0.276 


0.067 


-0.002 


-0.160 





276 




(2.521) 


(4.042) 


(1.579) 


(1.424) 


(2.524) 


(4.409) 


(1.579) 


(1 


425) 


(60, 40) 


0.100 


0.069 


-0.109 


0.192 


0.059 


0.065 


-0.109 





192 




(3.532) 


(3.206) 


(1.352) 


(1.272) 


(3.534) 


(3.375) 


(1.352) 


(1 


272) 


(60, 60) 


0.027 


0.013 


-0.094 


0.123 


-0.006 


0.010 


-0.094 





122 




(4.426) 


(2.613) 


(1.215) 


(1.174) 


(4.359) 


(2.751) 


(1.215) 


(1 


174) 


(60, 120) 


-0.020 


0.031 


-0.024 


0.077 


-0.044 


0.030 


-0.025 





077 




(6.131) 


(1.866) 


(1.118) 


(1.104) 


(6.132) 


(1.902) 


(1.118) 


(1 


104) 


(120,20) 


0.139 


0.044 


-0.243 


0.386 


0.060 


0.063 


-0.243 





386 




(2.478) 


(5.269) 


(1.681) 


(1.404) 


(2.479) 


(5.969) 


(1.681) 


(1 


404) 


(120,40) 


0.135 


0.037 


-0.186 


0.268 


0.078 


0.040 


-0.186 





268 




(3.588) 


(4.369) 


(1.366) 


(1.233) 


(3.589) 


(4.706) 


(1.366) 


(1 


233) 


(120,60) 


0.099 


0.011 


-0.162 


0.174 


0.052 


0.004 


-0.162 





174 




(4.272) 


(3.683) 


(1.249) 


(1.166) 


(4.273) 


(3.902) 


(1.249) 


(1 


167) 


(120,120) 


0.097 


-0.189 


-0.589 


0.114 


0.063 


-0.027 


-0.059 





114 




(6.084) 


(2.645) 


(1.115) 


(1.093) 


(6.086) 


(2.741) 


(1.115) 


(1 


093) 


(a) (721 = 


0.2, (731 = 


0.8, and 


(732 = 0.4. 
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